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Closed form expressions of approximate lower confidence limits for 
P(X>y ) and P(X>Y ) are presented where X and Y are independent and 
have normal probability distributions with unknown means and 
variances. Each of the three expressions requires values of percentile 
points from the standard t distribution and values of the standard 
normal cumulative distribution function to compute the lower 
confidence limit using a data set. The expressions are shown to be quite 
accurate for sample sizes of ten or larger. 
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1. INTRODUCTION 

Throughout this paper X and Y are independent normally distributed 

2 

variables with unknown means ji x and p y and unknown variances c x and 
2 

<7y. 0(z) is the standard normal CDF, and f tt/n is the 100(l-a)th percentile 

point of the standard t distribution with n degrees of freedom. The symbol 

t n (p) denotes a noncentral t variable with n degrees of freedom and 

2 

noncentrality parameter p. The terms X and S x are the sample mean and 

2 

sample variance of a random sample of size n on X. The terms Y and S y are 
the sample mean and sample variance of a random sample of size m on Y. 
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.2 2 2 

Sp = l(n-l)S x + (m-l)S y j / (n+m-2) and x, y, s x , s y , s p denote observed values 
of the corresponding random variables. 

If the random strength, X, of a device must exceed some worst case stress 
value y, the device reliability is often modeled as P(X>y). If the stress variable, 
Y, is also random the device reliability is frequently modeled as P(X>Y) which 
is the average of P(X>y ) with respect to the distribution of Y when X and Y are 
independent. In this paper we present closed expressions for approximate 
lower confidence limits for P(X>y ) and P(X>Y). In the latter case one 
expression is provided assuming o x = < 7 y and a different expression is given 
assuming o x * cr y . 

Several papers in the literature present the minimum variance unbiased 
estimator for P{X>y) when p x and a x are unknown. See, for example, 
Lieberman and Resnikoff (5), Folks and others (4), and Barton (1). Tables of 
exact interval estimates for P(X>y) were developed by Owen and Hua (6) for 
confidence levels of 90% and 95%. The exact procedure requires a search in 
tables of the non-central t distribution. Owen and Hua have performed this 
task and developed more convenient tables. One enters their tables using 
confidence level 1 -a, sample size n and sample statistic value, ( x-y)/s = k to 
obtain the lower confidence limit for P(X>y). The lower limit values obtained 
using our expression agree with their values to the nearest hundredth 
decimal point or better for sample sizes of ten or larger and 1 < k < 5. Access 
to values of t a >m and values of <T>(z) are required to use our closed form 
procedure which is also a function of a, n and k. 

If o x = Cy in the two variable case, our lower confidence limit expression 
for P(X>Y) is analogous to the single variate case. Values of f aj „ and &{z) are 
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needed to use this procedure which is a function of sample sizes n x , n y and 
( X-y ) / S p . 

Approximate confidence limits for P(X>Y) have been developed by 
Church and Harris (2) when Y has a standard normal distribution. Downton 
(3) modified their procedure slightly to obtain more accurate limits and 
suggests an approximate procedure when the means and variances of both X 
and Y are unknown. 



2. SUMMARY OF RESULTS 

For the single variable case we seek a lower 100(l-a)% confidence limit 
for P(X>y ) = R(y). Let S = (ii x -y)/c t x , then R(y) = 0(5), and a lower confidence 
limit R(y)i, for R(y) is given by 

m L = f(h) (i) 



where 5l is a lower confidence limit for 5. 

Owen and Hua (6) used the general confidence interval procedure to find 
5l ■ Specifically 5l is the value of 5 for which 



l-a = p[(X-y)/S<(T-y)/s] 
= P^ n -\({P X ~ y)Jn /<J X )< 



(2) 



Letting k = (x-y)/s, Owen and Hua develop tables of 0(50 for many values of 
k in [-3, 6], and sample sizes n = 2(1)18, 21(3)30, 40(20)100 and 1 -a = .90 and .95. 
The corresponding values of 5i can easily be obtained from 0~ l (0(5O)- 

t 

In Section 3 of this paper an approximate lower confidence limit, 5 L , for 5 
is shown to be 
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( 3 ) 



S L =k- 



1 k 2 l*, 

n + 2(«-l)J a ' n - ! 



Values of S L were compared with corresponding values of Sl from Owen and 
Hua's tables for numerous sets of ( n,k ) to obtain a more accurate expression, 
S L . Let 



[60 



N(a,n) = < 



n 

n 



if a =. 05, 
if a =.05, 
if a =.10, 



10 <n <60 
n > 60 
«>10 



( 4 ) 



S' L = k- 



1 k 2 \2 
n + 2(n _ l)J a ' N {a ’ n) 



( 5 ) 



The corresponding approximate 100(l-a)% lower confidence limit is 



r'( v ) l = 0(s' l ). 



( 6 ) 



Table 1 displays values of and those given by Owen and Hua, Rl- 

For the two variate case, we seek a lower 100(l-a)% confidence limit for 
P(X>Y). We first assume o x = try = a Let R = P(X>Y) and d = (ji x - f. iy)/o . then 
R = Oid/^fl) and a lower confidence limit, Rl, for R is 

Rl = <PC4/V 2) (7) 



It is shown in Section 3 that a lower 100(l-a)% confidence limit, di, for d is 
the value of d for which 



1 - a = P 



*((/*X -Hy)/ W0 / n) + (1 / m)) < (x - y) / spj(l 1 n) + (1 / m)] (8) 
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TABLE 1. APPROXIMATE (R L ) AND EXACT (R L ) CONFIDENCE LIMITS 
FOR P(X>y) 





a = .10 


ft 

II 

b 

cn 


n k 


Rl 


Rl 


Rl 


Rl 


10 


1 


.6768 


.6816 


.6334 


.6305 


2 


.8890 


.8913 


.8535 


.8519 


3 


.9736 


.9745 


.9560 


.9556 


4 


.9958 


.9960 


.9903 


.9903 


5 


.9996 


.9996 


.9985 


.9985 


18 


1 


.7298 


.7303 


.6460 


.6950 


2 


.9260 


.9257 


.9040 


.9035 


3 


.9877 


.9875 


.9800 


.9800 


4 


.9988 


.9988 


.9973 


.9973 


5 


.9999 


.9999 


.9998 


.9998 


30 


1 


.7597 


.7594 


.7337 


.7336 


2 


.9431 


.9426 


.9286 


.9285 


3 


.9925 


.9923 


.9885 


.9885 


4 


.9995 


.9994 


.9989 


.9989 


5 


.99998 


.99998 


.99994 


.99994 


60 


1 


.7865 


.7861 


.7688 


.7691 


2 


.9562 


.9559 


.9478 


.9480 


3 


.9954 


.9953 


.9936 


.9937 


4 


.9998 


.9998 


.9996 


.9996 


5 


.99999 


.99999 


.99999 


.99999 



/ 

Alternatively, an approximate lower confidence limit d La using a Taylor 
series expansion of ( x-y)/s p in the manner described in Section 3 is 
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\ 

d La =K ~ ((1 /«) + (!/ m ) +K 2 /2(n + m- 2))2 t a#ffl+n _ 2 (9) 

where X = ( x-y)/s p . Yang (8) used Monte Carlo simulations to evaluate the 
✓ , 

accuracy of R La = 0{d L J-^2) for a = .20, .10, and .05. One thousand 
/ 

replications of R La were generated for each parameter set (cr, p x , Py, m, n). 

Values of 1 and 20 were chosen for a then values of fi x and selected so that 

R = .90, .95 and .99. For each of these six parameter sets, three pairs of sample 

sizes were chosen for a total of eighteen parameter sets for each value of a. 

/ 

The results of the simulations are given in Table 2. If R Lq is an exact 



TABLE 2. ANALYSIS OF APPROXIMATE CONFIDENCE LIMITS FOR 
P(X>Y): EQUAL VARIANCES CASE 



R 


n 


m 


/ 

a = .20 P a = -05 

a = .10 


.900 


8 


8 


.8989, .801 


.8944, .906 


.8884, .960 


8 


30 


.9003, .798 


.9050, .888 


.8987, .952 


20 


30 


.9019, .788 


.9034, .888 


.9015, .947 


.950 


8 


8 


.9498, .801 


.9467, .909 


.9406, .962 


8 


30 


.9511, .791 


.9529, .886 


.9489, .951 


20 


30 


.9516, .785 


.9517, .889 


.9500, .950 


.990 


8 


8 


.9901, .797 


.9886, .911 


.9865, .962 


8 


30 


.9906, .783 


.9909, .881 


.9900, .950 


20 


30 


.9904, .789 


.9904, .895 


.9906, .945 



procedure, the values in the column labeled ^ L100 o(i-a; s ^ ou ^ equal the 
values of R in the same row. The symbol p denotes the proportion of the 
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1000 lower confidence limits that covered the value of R. The results were 
the same for <7=1 and o = 20. 

2 2 V2 

If we assume o x *o y> let R = P(X>Y) and r = (/J. x -P v ) / (o x + o y ) , then 
R = <p(r) and a lower 100(l-a)% confidence limit Ri for R is 

R l = 0(r L ) (10) 

where ri is a lower confidence limit for r. Let f be defined by 

1 

r = (*-y)/( s ? +s y ) 2 ( n ) 

The general method cannot be used to find a lower confidence limit for r, 
because the statistic r cannot be modified to an equivalent statistic whose 
distribution is known with r as the only unknown parameter. This is the 
same difficulty encountered in the well-known Beherns-Fisher problem 
associated with finding confidence intervals for fi x -^L y . However an 
approximate lower confidence limit ri for r can be found by using a Taylor 
series expansion of f and fitting a t distribution to the distribution of f using 
random degrees of freedom, v, which is computed from the data. This 
procedure is developed in Section 3. The expression forri, a is 

l 

*a,v 

( 12 ) 

where 

u = (s 2 +Sy) /(s x /(n-l) + Sy /(m-1)) (13) 

The method used in this procedure to resolve the Beherns-Fisher type of 
difficulty is similar to that employed by Welch (7). The expression for v in 
equation (11) is different from that used by Welch. 



r La = r 



(s 2 /n + Sy /m)/(s 2 +s 2 ) + f 2 (s^ /(«-!) + sj /(m-l))/2(s 2 +s 2 j ? 
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Yang (8) has used Monte Carlo simulations to evaluate the accuracy of 
this procedure given by Equations (8), (9), (10) and (11). The results of his 
simulations are given in Table 3. The symbol p denotes the proportion of the 
1000 randomly generated lower confidence limits that covered the value of R. 



TABLE 3. ANALYSIS OF APPROXIMATE CONFIDENCE LIMITS FOR 
P(X>Y) FOR UNEQUAL VARIANCES CASE 



R 


Ox 


oy 


ft 


m 


1000(l-a)/P 

a =.20 a = .10 a = .05 


.900 


1.0 


2.0 


10 


20 


.9008, .796 


.9008, .897 


.8959, .956 


25 


35 


.8996, .803 


.8993, .901 


.8988, .963 


75 


50 


.8997, .801 


.8990, .907 


.8998, .950 


10.0 


40.0 


10 


20 


.9019, .791 


.9022, .889 


.9009, .947 


25 


35 


.9012, .790 


.9005, .897 


.8980, .954 


75 


50 


.9000, .800 


.8995, .902 


.8992, .952 


.950 


1.0 


2.0 


10 


20 


.9500, .800 


.9497, .901 


.9461, .955 


25 


35 


.9502, .798 


.9502, .899 


.9482, .959 


75 


50 


.9496, .808 


.9493, .903 


.9500, .948 


10.0 


40.0 


10 


20 


.9513, .784 


.9510, .896 


.9517, .945 


25 


35 


.9514, .782 


.9494, .902 


.9470, .955 


75 


50 


.9487, .810 


.9494, .904 


.9500, .950 


.990 


1.0 


2.0 


10 


20 


.9902, .793 


.9899, .906 


.9888, .955 


25 


35 


.9903, .789 


.9899, .901 


.9891, .961 


75 


50 


.9898, .811 


.9899, .905 


.9899, .952 


10.0 


40.0 


10 


20 


.9906, .777 


.9901, .898 


.9903, .942 


25 


35 


.9907, .775 


.9904, .893 


.9892, .955 


75 


50 


.9899, .803 


.9898, .906 


.9900, .949 
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3. ANALYSIS 



For the single variate case we fit a t distribution to the distribution of the 
statistic (x-y)/s x . If g(x,s) = ( x-y)/s x = k is expanded in a Taylor series about 
( fi x ,o x ), and the subscript x is dropped, then 

g(x,s) = (/x - y) / s + (x - n) / a- (s - <r)(/x - y) / s 2 . 

Then 

E(k) = E[g(J, s)] = (fi - y)(4 - 3^1-1/ (2n — 1)) / a 

= (ji-y)/a if n > 8, (14) 

and 

2 

<*k = var [g(^s)] - var(x) / g 2 + ((/x - y) / y 2 ) var(s) 

= (1 / n) + ((/x - y) / <r) 2 / 2(n - 1) (15) 

2 

Let djt = (1/xi) + k 2 /2(n-l). It is easily shown that k is a consistent 
estimator for (fj. - y) / a = 8 and for large n the distribution of k will be 
approximately normal. Consequently an approximate lower confidence limit 
for (/i-y)/(j would be k - 6 k Z a . We choose instead to approximate the 
distribution of k with a central t distribution with n - 1 degrees of freedom 
and thus obtain the approximate lower 100(1 - a)% confidence limit, dux, for 
8 as 

S La =k-((l/n) + k 2 / 2 (n - 1))2 (16) 
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As discussed in Section 2, the tables developed by Owen and Hua can be used 

to modify the right member of Equation (12) to obtain the more accurate 

* 

lower confidence limit S La given by 

\ 

6l a =k-{il/n) + k 2 /2(n- l))2 t a N{a n) (17) 

where N(a,n) is defined in Equation (4). 

In the two variable cases when o x - Oy, we fit a t distribution to the 
distribution of the statistic ( x - y)/s p . Expanding g(x-y, s p ) = (x-y)/s p = K in a 
Taylor series about ( jj. x - fXy,d) and collecting terms we get 

^ = ^-y^p) = [(/z x -Az y )/a] + [(x-y-(/i x -/iy))/a]-(s2-a 2 )(^-/i y )/ 2a 3 
Then 

E(K) = — — — (18) 

G 

var (K) = ((1 / n)) + (1 / mj) + (fJ. x ~Hyf / 2 o 2 (n + m - 2) (19) 

Let 

l 

a K = [{l / n) + {l / m) + K 2 / 2{n + m-2)Y (20) 

Proceeding in a manner similar to the single variate case we obtain the 
approximate lower confidence limit, di ia/ for d = ( n x -fiy)/cj given by 

<<La = K - (('/ f.) + (1 / m) + fC 2 / 2(n + m - 2))2 l 0 „ tm . 2 (21) 

and 

Rta = «*(4 o/V2) (22) 
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When o x *o y we fit a t distribution to the distribution of the statistic 

2 2 1/2 2 2 
f = (x-y)/(s x +s y ) . Expanding f in a Taylor series about {ji x - \i y , a x +<7 y ) and 

collecting terms we get 



Consequently 



w-\ • Vi-Vy 

£(r)= ^? 



(23) 



2 3 

var(f) = var(T - y ) / (o? + <Jy ) + (// x - /x y ) var(s* + sj) / 4(o? + o£) 

= ((<*? / n + / m) / (<J 2 + <t^ )) + (/i x - M y f (<t£ / (« - 1) + <rj / (m - 1)) / 2 ( 0 * + <r y )" 

(24) 

Let 



<r? = 



(s?/» + s^/m)/(s^+s2) + (x-y) 2 (s^ /(n-l) + sj /(m-l)/2(s? + s£)' 



(25) 



An approximate 100(l-ct)% lower confidence limit, r^, for 
(^x-A/)/(^ + o-y) 1/2 sr is 

*La = r-Gft a/V (26) 



where v is some appropriate degrees of freedom. Then 

Rlcx = <Xr La ) 



(27) 



The following method for finding an "appropriate" degrees of freedom is 
used only to develop an expression for the degrees of freedom, o, that makes 
equation (23) sufficiently accurate. If a strategy similar to that used in the two 
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previous interval methods is used to find an approximate confidence limit 
for r, we need to find the approximate degrees of freedom for an equivalent 
expression for f which looks something like a noncentral f distribution. Yang 
[9] used a different approach to the following procedure, but obtained the 
same result. The expression 



suggests, but is not, a noncentral t distribution with noncentrality parameter 
r. If we were to use a t distribution to fit the distribution of r, the radicand 




(28) 



2 2 2 2 2 
(s* + s y )/{G x + Oy) should be of the form xjc. Therefore 




(29) 



Using the properties var^j = 2c,var|s*j = lOy / (n - l),var|syj = 2Cy / (m-1) 
and taking the variance of both sides of equation (26), we have 




Solving for c we get 




Finally we let 




(30) 
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4. CONCLUSIONS 



We have derived closed form expressions for approximate lower 
confidence limits for the reliability of a device when reliability is modeled as 
P(X>y) or P(X>Y) where X and Y are independent normally distributed 
variables with unknown means and variances. The expressions have been 
evaluated for accuracy and demonstrated to be quite accurate when sample 
sizes are larger than ten. The three confidence limit expressions presented are 
easy to compute and can be programmed on some existing hand-held 
calculators. Percentile values, f 0/ „, of the standard t distribution and values of 
the standardize normal cumulative distribution function, &(z), are required 
for each expression to compute the lower confidence limit values for given 
data sets. 

In mechanical reliability settings, X usually denotes strength and Y 
denotes stress. However both reliability models, P(X>Y) and P(X>y), have 
numerous applications when X and Y denote times to first occurrence of 
events. 
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